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Abstract 

Let T be a rooted and weighted tree, where the weight of any node is equal to 
the sum of the weights of its children. The popular Treemap algorithm visualizes such 
a tree as a hierarchical partition of a square into rectangles, where the area of the 
rectangle corresponding to any node in T is equal to the weight of that node. The 
aspect ratio of the rectangles in such a rectangular partition necessarily depends on 
the weights and can become arbitrarily high. 

We introduce a new hierarchical partition scheme, called a polygonal partition, which 
uses convex polygons rather than just rectangles. We present two methods for con- 
structing polygonal partitions, both having guarantees on the worst-case aspect ratio 
of the constructed polygons; in particular, both methods guarantee a bound on the 
aspect ratio that is independent of the weights of the nodes. 

We also consider rectangular partitions with slack, where the areas of the rectangles 
may differ slightly from the weights of the corresponding nodes. We show that this 
makes it possible to obtain partitions with constant aspect ratio. This result generalizes 
to hyper-rectangular partitions in R d . We use these partitions with slack for embedding 
ultrametrics into d-dimensional Euclidean space: we give a polylog(A)-approximation 
algorithm for embedding n-point ultrametrics into R d with minimum distortion, where 
A denotes the spread of the metric. The previously best-known approximation ratio 
for this problem was polynomial in n. This is the first algorithm for embedding a 
non-trivial family of weighted-graph metrics into a space of constant dimension that 
achieves polylogarithmic approximation ratio. 

1 Introduction 



Hierarchical structures are commonplace in many areas. It is not surprising, therefore, 
that the visualization of hierarchical structures — in other words, of rooted trees — is one of 
the most widely studied problems in information visualization and graph drawing. In the 
weighted variant of the problem, we are given a rooted tree in which each leaf has a positive 
weight and the weight of each internal node is the sum of the weights of the leaves in its 
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subtree. One of the most successful practical algorithms for visualizing such weighted trees 
is the so-called Treemap algorithm. Treemap visualizes the given tree by constructing a 



hierarchical rectangular partition of a square, as illustrated in Figure 1(a) More precisely, 
Treemap assigns a rectangle to each node in the tree such that 

• the area of the rectangle is equal to the weight of the node; 

• the rectangles of the children of each internal node v form a partition of the rectangle 
of v. 

The Treemap algorithm was proposed by Shneiderman [T5] and its first efficient implemen- 
tation was given by Johnson and Shneiderman [3]. Treemap has been used to visualize a 
wide range of hierarchical data, including stock portfolios [10] , news items [20] , blogs [19] , 
business data [18] , tennis matches [8] , photo collections [5] , and file-system usage [T5] |21] . 
Shneiderman maintains a webpage j!4| that describes the history of his invention and gives 
an overview of applications and proposed extensions to his original idea. Below we only 
discuss the results that are directly related to our work. 

In general, there are many different rectangular partitions corresponding to a given tree. 
To obtain an effective visualization it is desirable that the aspect ratio of the rectangles be 
kept as small as possible; this way the individual rectangles are easier to distinguish and 
the areas of the rectangles are easier to estimate. Various heuristics have been proposed 
for minimizing the aspect ratio of the rectangles in the partition [SJ 1161 117] . Unfortunately, 
the aspect ratio can become arbitrarily bad if the weights have unfavorable values. For 
example, consider a tree with a root and two leaves, where the first leaf has weight 1 and 
the second has weight W. Then the optimal aspect ratio of any rectangular partition is 
unbounded as W — > oo. Hence, in order to obtain guarantees on the aspect ratio we cannot 
restrict ourselves to rectangles. This lead Balzer et al. [2] |T] to introduce Voronoi treemaps, 
which use more general regions in the partition. However, their approach is heuristic and it 
does not come with any guarantees on the aspect ratio of the produced regions. Thus the 
following natural question is still open: 

Suppose we are allowed to use arbitrary convex polygons in the partition, rather 
than just rectangles. Is it then always possible to obtain a partition that achieves 
aspect ratio independent of the weights of the nodes in the input tree? (The 
aspect ratio of a convex region A is defined as diam(A) 2 / area(A), where diam(A) 
is its diameter and area(^4) is its area.) 



Our results on hierarchical partitions. Our main result is an affirmative answer to the 
question above: we present two algorithms that, given an n-node tree of depth d, construct 
a partition into convex polygons with aspect ratio 0(poly(d, log n)). Our algorithms, which 
are described in Section [2] are very simple. They first convert the input tree into a binary 
tree, and then recursively partition the initial square region using straight-line cuts. The 
methods differ in the way in which the orientation of the cutting line is chosen at each step. 
The greedy method minimizes the maximum aspect ratio of two subpolygons resulting from 
the cut, while the angular method maximizes the angle that the splitting line makes with 
any of the edges of the polygon being cut. Figures 1(b) and 1(c) depict partitions computed 
by our algorithms. 

The main challenge lies in the analysis of the aspect ratio achieved by the algorithms, 
which is given in Sections |3] and |H We prove that the angular method produces a partition 
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(a) A rectangular partition (b) A greedy polygonal partition 




(c) An angular polygonal partition 

Figure 1: Sample polygonal partitions 
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with aspect ratio 0(d + log n). For the greedy method we can only prove an aspect ratio of 
0((d + logn) 8 ). Since the greedy method is the most natural one, we believe this result is 
still interesting. Moreover, in the (limited) experiments we have done — see Section [5] — the 
greedy method always outperforms the angular method. Besides these two algorithms, we 
also prove a lower bound: we show in Section [5] that for certain trees and weights, any 
partition into convex polygons must have polygons with aspect ratio tt(d). 

After having studied the problem of constructing polygonal partitions, we return to 
rectangular partitions. As observed, it is in general not possible to obtain any guarantees 
on the aspect ratio of the rectangles in the partition. If, however, we are willing to let 
the area of a rectangle deviate slightly from the weight of its corresponding node then we 
can obtain bounded aspect ratio, as we show in Section |B1 More precisely, we obtain the 
following partition. Let e £ (0, 1/3). We allow that the area A of the rectangle assigned to 
every non-root node v of is shrunken by a factor of at most (1 — e) compared to its share of 
the area A' of the rectangle of the parent v'. That is, we only require that 

(1 — e) • — ■ A' <A< — • A', 

w v > W v i 

where w v and w v > are the weights of v and v', respectively. Then we show that the aspect 
ratio of every rectangle can be bounded by 1/e. We call this kind of partition a rectangular 
partition with slack. 

Application to embedding ultrametrics. A metric space M = (X, D) is a set X 

together with a symmetric distance function D:IxI-) R>o that satisfies the triangle 
inequality and f(x±, x^) = if and only if x\ = x-2,.2 Sometimes it is hard to see the structure 
in a metric space (for example, when it is just specified by a table listing all the pairwise 
distances) or to do efficient computations on it. Then it can be advantageous to embed the 
metric space M into a simpler metric space M' = (X', D') by mapping each element x £ X 
to a point f(x) £ X'. Often the space X' is taken to be M. d (for some small d) and D' is 
an £ p metric. Usually it is impossible to exactly preserve all the pairwise distances in an 
embedding. Thus the goal is to find an embedding where the distortion, defined as 

D'(f(x),f(y)) D(x,y) 
max ; : • max 



x,yex D(x,y) x, v ex D'(f(x), f(y)) ' 

is as small as possible. In general, finding embeddings with minimum distortion is hard. 
For instance, one cannot approximate minimum-distortion embeddings of general metrics 
into K d (with d > 2) to within a factor of roughly ri 1 /( 22d - 10 ) i unless p=np [11]. Hence, 
it is interesting to consider embedding special classes of metrics. We are interested in 
ultrametrics, which are metrics satisfying the following strengthened version of the triangle 
inequality: for any x,y,z £ X we have D(x,z) < ma,x{D(x,y), D(y, z)} . Equivalently, 
M = (X, D) is an ultrametric if it can be realized as the shortest-path metric over the 
leaves of a rooted edge-weighted tree such that the distance between the root and any leaf 
is the same. Badoiu, Chuzhoy, Indyk, and Sidiropoulos [7] showed that finding a minimum- 
distortion embedding of an ultrametric into R 2 is NP-complete and presented an (^(n 1 / 3 )- 
approximation algorithm for the problem. They extended the algorithm to embedding 
ultrametrics into R d , obtaining a (n^^d^-approximation. 

We show a connection between embedding ultrametrics into K d and (hyper-) rectangular 
partitions with slack and we use this connection to obtain a significant improvement over 
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the result of Badoiu et al. [7]. More precisely, using our results on (hyper-)rectangular 
partitions with slack, we obtain a polynomial-time polylog(A)-approximation algorithm for 
the problem of embedding ultrametrics into (M. d ,l2) with minimum distortion, where A is 
the spread of X . (The spread of X is defined as A = diam(X)/ min Xi!/e x D(x, y).) As long 
as the spread is polynomial in n, this is an exponential improvement. In fact, our algorithm 
is the first one that achieves sub-polynomial approximation guarantee for embedding a non- 
trivial family of graph metrics into a space of constant dimension. 

2 The two algorithms 

Before we present our algorithms, wc define the problem more formally and introduce some 
notation. Let T be a rooted tree. We say that T is properly weighted if each node v £ T 
has a positive weight w{v) that equals the sum of the weights of the children of v. Wc 
assume without loss of generality that w(root(T)) = 1. A polygonal partition for a properly 
weighted tree assigns a convex polygon P(v) to each node v £ T such that 

• the polygon P(root(7~)) is the unit square; 

• for any node v we have area(P(f)) = w(v); 

• for any node v. the polygons assigned to the children of v form a disjoint partition of 
P(y). 

Recall that the aspect ratio of a planar convex region A, denoted by asp(A), is defined as 
asp(A) := diam(A) 2 / area(A). The aspect ratio of a polygonal partition is the maximum 
aspect ratio of any of the polygons in the partition. Our goal is to show that any properly 
weighted tree admits a fat polygon partition, that is, a polygonal partition with small aspect 
ratio. 

We propose two methods for constructing fat polygonal partitions. They both start with 
transforming the input tree T into a binary tree T 7 . The nodes of T are a subset of nodes of 
T' , and two nodes are in the ancestor-descendant relation in T if and only if they are in the 
same relation in T' ■ The weights assigned to nodes of T are preserved in T 1 . Any polygonal 
partition for T' restricted to nodes from T is a polygonal partition for T. Then, for the 
binary tree T', it suffices to design a method that cuts the polygon P(y) corresponding to a 
node v into two polygons of prcspccificd areas that correspond to z/s children. Wc propose 
two such methods: the angular method and the greedy method. To achieve a polygonal 
partition for T', it suffices to recursively apply one of the cutting methods. 

The transformation into a binary tree. We transform the input tree T into a binary 
tree T' by replacing every internal node v of degree greater than two by a collection of nodes 
whose subtrees together are exactly the subtrees of v. This can be done in such a way that 
dcpth(T') = 0(depth(T) + logn) [T3]. For completeness we sketch how this transformation 
is done. 

For a node we use T v to denote the subtree rooted at and \T V \ to denote the number 
of nodes in T v . The transformation is a recursive process, starting at the root of T. Suppose 
we reach a node v. If v has degree two or less, we just recurse on the at most two children 
of v. If v has degree k > 3, we proceed as follows. Let C(v) be the set of children of v, 
and let /i £ C(v) be the child with the largest number of nodes in its subtree. We partition 
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C{u) \ {n} into two non-empty subsets C\{v) and Ciiy') such that Y^uedfv) I^mI < \7~v\/2 
and S«ec a (i') 1 7/* I < l^l/^- We create three new nodes v\,V2,v$ and modify the tree as 
shown in Figured] The weights w(vi), w^), w{vz) are set to the sum of the weights of the 
leaves in their respective subtrees. Finally, we recurse on Vi, /i, and v%. After the procedure 



has finished we have a (properly weighted) tree in which every node has degree at most two. 
We remove all degree- 1 nodes to obtain our binary tree T' ■ The depth of T' is at most 
2(depth(T) + logn), because every time we go down two levels in T' we either pass through 
an original node from T or the number of nodes in the subtree halves. 

Methods for cutting a polygon. Suppose we have to cut a convex polygon P{y) into 
two subpolygons. Note that if we fix an orientation for the cut, then there are only two 
choices left for the cut because the areas of the subpolygons are prespecified. (The two 
choices correspond to having the smaller of the two areas to the left or to the right of the 
cut.) Our cutting methods are the following. 

• Angular: Let c denote the cut, that is, the line segment separating the two subpoly- 
gons of P(v)- We select the orientation of c such that we maximize min{angle(c, e) : 
e is an edge of the input polygon}, where angle(c, e) is the smaller of the angles be- 
tween the lines £(c) and £(e) containing c and e, respectively. In other words, we cut 
in a direction as different as possible from all the orientations determined by the edges 
of the polygon. We then take any of the two cuts of the selected orientation. 

• Greedy: The greedy method selects the cut that minimizes the maximum of the aspect 
ratios of the two subpolygons. 

Experiments. To get an idea of the relative performance of the two methods we imple- 
mented them and performed some experiments. Table[T]shows the results on two hierarchies. 
One is synthetic and was generated using a random process, the other is the home directory 
with all subfolders of one of the authors. Leaves in the latter hierarchy are the files in any of 
the folders, and the weight of a leaf is the size of the corresponding file. For comparison, we 
also ran two additional partitioning methods. Both of these methods use the transformation 
of the input into a binary tree. The random method always makes a cut in a random direc- 
tion. In the greedy rectangular method, all polygons are rectangles and all cuts are parallel 
to the sides of the original rectangle. The method always greedily chooses the cut that is 
perpendicular to the longer side, which maximizes the aspect ratio of the two subpolygons 
resulting from the cut. 
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Figure 2: Transforming a tree to a binary tree. 
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Method 


Synthetic Data 


Home Folder 


Average 


Maximum 


Average 


Maximum 


Angular 


3.79 


13.19 


3.87 


20.11 


Greedy 


2.56 


6.79 


2.57 


8.39 


Random 


5.79 


355.14 


6.26 


1609.66 


Greedy Rectangular 


3.52 


1445.99 


24.49 


230308.30 



Table 1: Aspect ratios of partitions generated by various methods 



In all our tests, the methods partitioned a square. The greedy method performs best, 
closely followed by the angular method. Interestingly, the greedy rectangular method per- 
forms even worse than the random method — apparently restricting to rectangles is a very 
bad idea as far as aspect ratio is concerned. 

In the next two sections we prove that both the angular method and the greedy method 
construct a partition in which the aspect ratios are (9(poly(dcpthT+logn)). For the angular 
method, the proof is simpler and gives a better bound on the worst-case aspect ratio. We 
present the more complicated proof for the greedy method because it is the most natural 
method and it has the best performance in practice. 

3 Analysis of angular partitions 

The idea behind the angular partitioning method is that a polygon with large aspect ratio 
must have two edges that are almost parallel. Hence, if we avoid using partition lines 
whose orientations are too close to each other, then we can control the aspect ratio of our 
subpolygons. Next we make this idea precise. 

Let U be the initial unit square that we partition,, and let <f> > be a parameter. Recall 
that for two line segments e and e', we use angle(e, e') to denote the smaller angle defined by 
the lines £{e) and £(e') containing e and e', respectively. We define a convex polygon P C U 
to be a (j)-separated polygon if it satisfies the following condition. For any two distinct edges 
e, e' we have: 

(i) angle(e, e') > </>; or 

(ii) e is contained in IPs top edge and e' is contained in IPs bottom edge (or vice versa); 
or 

(iii) e is contained in IPs left edge and e' is contained in C/'s right edge (or vice versa). 
Lemma 1. The aspect ratio of a (f>- separated polygon P is 0(l/(f>). 

Proof. Let d := diam(P) and let uv be a diagonal of P that has length d. Consider the 
bounding box B of P that has two edges parallel to uv. We call the edge of B parallel to 
and above uv its top edge, and the edge of B parallel to and below uv its bottom edge. Let r 
be a vertex of P on the top edge of B and let s be a vertex on its bottom edge — see Fig. [3] 
Let ei and e2 be the edges of P incident to r, and let e-$ and e\ be the edges incident to s. 
Let the angles a±, . . . , be defined as in Fig. [3] We distinguish two cases. 
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Figure 3: Illustration for the proof of Lemma [TJ 



• Case (a): none of e\, e2, e 3 , e 4 are parallel. 

By condition (i), this implies that the angles any two edges make is at least 0. Hence, 
we have 

< angle(ei,e 3 ) < max(ai,a 3 ), (1) 

< angle(e 2 ,e 4 ) < max(a 2 ,a 4 ), (2) 

< angle(ei, e 4 ) < ai + a4, (3) 

< angle(e 2 ,e 3 ) < a 2 + a 3 . (4) 

By (lU) we have ol\ + a 4 > 0. Now assume without loss of generality that ot\ > 0/2. 
If a 2 > 0/2 as well, then 

area(wvr) > (d 2 /4) ■ sin(</>/2). 
Since uvr C P, this implies that 

If q;2 < 0/2, then we use @ and to conclude that a 4 > <fi and a 3 > 0/2. Hence, 
we now have area(uz;s) > (rf 2 /4) • sin(0/2), which implies that asp(P) = 0(l/(f>). 

• Case (b): some edges in ei,e2,e 3 ,e4 are parallel. 

By conditions (i)-(iii), two edges of P can be parallel only if they are contained in 
opposite edges of U. Hence, \rs\ > 1. Moreover, since uv defines the diameter we 
have \uv\ > \rs\. Let a := angle(uw, rs), as illustrated in Fig. 2J If a > min{</>, 7r/4}, 
this is easily seen to imply that area(P) > area(wrws) = 0(0), which means that 
asp(P) = O(l/0). Now consider the case where a < <j> and a < it/ A. We show that this 
leads to a contradiction. Let x := uvOrs and assume without loss of generality that, as 
in Figure^ we have a = /Lrxv. Then angle(ei, e 3 ) < a < 0. By conditions (i)-(iii) this 
can only happen if e\ and e 3 are contained in opposite edges of U, if we assume that r 
and s are chosen such that they maximize the distances between r and s, which can be 
done without loss of generality. We have anglc(e 3 , rs) < anglc(itt>, rs) — a < <p < n/A. 
However, it is impossible to place two points r and s on opposite sides of U such that 
anglc(e 3 , rs) < n/A. The smallest angle one can obtain is n/A. 
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s 

Figure 4: The case of parallel edges. 



□ 

To construct a polygonal partition we use the procedure described in Section [2] Thus, 
we first transform the input tree T into a corresponding binary tree T'. Next, we recursively 
apply the angular cutting method to T 7 , that is, at each node v, we cut the polygon P{v), 
using a cut c that maximizes the minimum angle c makes with any of the edges of P(v)- 

Lemma 2. Let P{y) be the subpolygon generated by the algorithm above for a node v at 
level k in T' . Then P{v) is a (n/(2k + 6)) -separated polygon. 

Proof. The proof is by induction on k. 

For k = 0, we have v = root(T') and P{v) is a unit square. Hence, P{v) is (7r/2)- 
scparated and therefore also (7r/6)-scparated. 

For k > we argue as follows. By the induction hypothesis, the polygon P(n) corre- 
sponding to the parent fx of v is (7r/(2fc + 4))-separated. Moreover, by construction it has 
at most 4 + (k — 1) = k + 3 edges. Consider the sorted (circular) sequence of angles that 
these edges make with the £-axis. By the pigeon-hole principle, there must be two adjacent 
angles that are at least n/(k + 3) apart. Hence, the cut c that is chosen to partition P(/x) 
makes an angle at least ir/(2k + 6) with all edges of P(n). □ 

Recall that the depth of the binary tree T' is 0(depth(T) + logn). Hence, we get the 
following theorem. 

Theorem 1. Let T be a properly weighted tree with n nodes. Then the angular partitioning 
method constructs a polygonal partition for T whose aspect ratio is 0(depth(T) + logrt). 

4 Analysis of greedy partitions 

We now turn our attention to the greedy method, which at each step chooses a cut c that 
minimizes the maximum aspect ratio of the two subpolygons resulting from the cut. The 
main component of our proof that polygonal partitions with good properties exist will be 
the following lemma. It shows that there is always a way to cut a polygon into two smaller 
polygons of required areas so that the aspect ratios of the new subpolygons are bounded. 
The proof of the lemma is long and consists of a case analysis. 

Lemma 3 (Good cuts). Let P C M. 2 be a convex polygon with k vertices, and let a £ (0, 1/2]. 
Then P can be partitioned into two convex polygons Pi and P2 such that 



9 



• Each of the P\ and P 2 has at most k + 1 vertices. 

• area(Pi) = a ■ arca(P) ; and area(P2) = (1 — a) ■ arca(P). 

• max{asp(Pi), asp(p2)} < max{asp(P) (l + |) ,fc 8 }. 

Proof. We distinguish two cases, depending on whether a < 1/k 2 (that is, we are cutting 
off a relatively small subpolygon) or not. 

Case 1: a < 1/k 2 . Let <j> be the smallest angle of P, and let v be a vertex of P whose 
interior angle is <j>. Since P has k vertices, we have 

*<*(l-f 

Let £ be the angular bisector at v. Consider the cut c orthogonal to I such that 
area(Pi) = a-area(P), where P\ is the subpolygon induced by c having v as a vertex — 



see Figure 5(a) for an illustration. Let P2 be the other subpolygon. Clearly, Pi and P2 
are convex polygons of the required area with at most k + 1 vertices each. Therefore, 
it remains to bound the aspect ratios of Pi and P2. 

Since P2 C P, we have 

as P (P 2 ) = < n dia " (P)2 rp , =^< asp(P)(l + 2 a ) 

area(P2j (1 — a) ■ arca(P) 1— a 

< asp(P)(\ + -^ < asp(P)(^l + i 

We next bound asp(Pi). Let xi,x 2 be the two cndpoints of the cut c, and let t be 
the distance between x\ and Let h be the distance from v to c. We distinguish 
between two subcases. 

Case 1.1: t > h/k 2 . Since P is convex, the triangle vxix 2 is contained in Pi. 
Therefore, 

area(Pi) >h-t/2> h 2 /(2k 2 ). 

On the other hand, since c is normal to the bisector of the angle of v, it follows 
that Pi is contained inside a rectangle of width h and height Pf , with 

H < 2 • h ■ tan(0/2) < 2 • h ■ tan < 2-h/ tan(?r/fc) < 2-h- 

Thus, diam(Pi) < + 2 ■ fc/71-). It follows that 

_ diam(Pi) 2 (h + 2-fe-fc/Tr) 2 5 
a&P(A) " area(Pi) < hVC^ 5 ) ' 

Case 1.2: t < h/k 2 . Let l\ be the line passing through v and xi, and let l 2 be the 
line passing through v and X2. Let 7 be the angle between t\ and £2- Observe 
that P2 is contained between l\ and £2- Therefore, if u is the point in P2 farthest 
away from v we have 

— tt\uv\ 2 > area(P 2 ), 
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Figure 5: Partitioning P into Pi and P2 when a < 1/fc 2 



where \uv\ denotes the length of the segment uv. It follows that 



diam(P) 2 > \uv\ 2 > -(1 - a) ■ area(P). 



Therefore, 



diam P 2 2 , 

asp P = )— '— > - (1 

PV ; area (P) ~ 7 v 



We now give an upper bound on the diameter of Pi. Assume without loss of 
generality that \vx2\ > \vxi\. Consider a segment yij/2 parallel to X1X2 with 



2/1,2/2 € <9P such that yij/2 lies between and v — see Figure 5(b) Let h' be 
the distance between v and yij/2- We first argue that [yiyz] < 2t. 
Assume for the sake of contradiction that 1 2/1 2/2 1 > 2i. Let gi be the line passing 
through yi and xi, and let (72 be the line passing through 7/2 and X2. Observe 
that since 1 2/1 2/2 1 > l^i^l, the lines g\ and 52 intersect in a point to such that 
P2 is contained in the triangle x\x?,w. Furthermore, the polygon vy\X\X2y2 is 
contained in Pi. If ft,' > h/2, then the area of the triangle vy\y2 is greater or equal 
to the area of the triangle x\X2W. Therefore, area(Pi) > area(P2), contradicting 
the fact that a < 1/fc 2 . If, on the other hand, h! < h/2, then the area of the 
quadrilateral y\X\X2y2, is greater than the area of the triangle X1X2W, again 
implying that area(Pi) > area(P2), a contradiction. Therefore, we obtain that 

\ym\ < 2t. 

It now follows that any point q G Pi is at distance at most 2t from the line 
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segment vx 2 . Moreover, we have t < h/k 2 < \vx 2 \/k 2 . Hence, 

diam(Pi) = max \qq'\ < 2t + \vx 2 \ + 2t < \vx 2 \ ( 1 + ttt 

Let x* be the point on the line segment x±x 2 that is closest to v. Since \vx 2 \ > h, 
we have 

area^) > £n\vx*\ 2 > l(\vx 2 \ - t) 2 > 1\ VX2 \ 2 (l - ^ 
Therefore, 

, x dianhPx) 2 2 (1+4/fc 2 ) 2 , , (1 + 4/fc 2 ) 2 

asp Pi = < frr^ < asp P )- 1 ' 

area(Pi) 7 1 — 1/Ar (1 — l/fc^) 2 

< asp(P) • (1 + 6/fc 2 ) 2 < asp(P) • (1 + 2/fc) 2 < asp(P) • (1 + 6/fc). 

Case 2: a > 1/fc 2 . 

Case 2.1: asp(P) < fc 6 . In this case any cut giving the two subpolygons P 1 and P 2 
the required areas works. Indeed, 

aspCPr) = < < fc 2 .as P (P)<fc* 

area(Pi) a • area(P) 

and 

as P (P 2 ) = < n d T (P)2 rp , < 2-as P (P) < 2-*- < fc 7 . 

area(P2j (1 — a) • area(P) 

Case 2.2: asp(P) > fc 6 . Pick points v\,v 2 G P, such that l^ii^l = diam(P). For each 
z G [0,diam(P)], let £(z) be a line normal to v\v 2 that is at distance z from ui 
and intersects P. Note that £{Q) contains v\ and £(diam(P)) contains v 2 . Define 
f(z) to be the length of the intersection of P with £(z). Observe that 

/•diam(P) 

area(P) = / f(z)dz 

Jz=0 

Pick Si,s 2 G [0, diam(P)], so that 

fSi />diam(P) 



r 1 r 

a • area(P) = / f(z)dz = I 

Jz=0 J z= 



f(z)dz. 

diam(P) — S2 



Let Qi be the part of P that is contained between £(0) and £(si). Similarly, let 
Q 2 be the part of P that is contained between £(diam(P) — s 2 ) and £(diam(P)). 
Clearly, both Q\ and Q 2 are convex polygons with at most fc + 1 vertices. 
First, we will show that 

area(Qi) area(Q 2 ) \ < area(P) 



si s 2 J diam(P) 
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Figure 6: Partitioning P into Pi and P2, when a > 1/k 2 : Case 2.2. 



Assume for a contradiction that area(Qi)/ Si > area(P) / diam(P) and area(Q2)/s2 > 
area(P)/ diam(P). It follows that there exist z\ <E [0, si] and Z2 <G [diam(P) — S2] 
such that f{z\) > area(P)/ diam(P) and f(z 2 ) > area(P)/diam(P). Since P 
is convex, / is a bitonic function. Therefore, for each z £ [2:1,22], we have 
f(z) > area(P)/ diam(P). It follows that 

arca(P) = arca(Q 1 ) + arca(Q 2 ) + arca(P \ (Q 1 U Q 2 )) > - — . . ■ diam(P), 

diam(P) 

a contradiction. 

We can therefore assume without loss of generality that 

area(Qi) ^ area(P) 



Si diam(P) 
Note that this implies 

Si > a ■ diam(P). 

We set Pi = Qi, and P2 = P \ Qi- It remains to bound asp(Pi) and asp(p2). 
By the convexity of P, we have 

area(P) > max f(z) ■ diam(P)/2. 

Z e[0,diam(P)] 

Since asp(P) > fc 6 , it follows that 

areaf P) 2 

max f(z) < 2-- ■ diam(P) < — ■ diam(P). 

z e[o,diam(P)] ~ diam(P) 2 K ' fc 6 y ' 

This implies that P is contained inside a rectangle with one edge of length 
diam(P) parallel to V1V2, and one edge of length A ■ diam(P) normal to v\V2- 
Thus, 

4 

diam(Pi) < si + — ■ diam(P). 
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Let <ti, (72 be the two points where £(s±) intersects d P. Let (i, C2, be the lines 
passing through V\ and a±, and Wi and 172, respectively. Let also <r[ and a' 2 be 
the points where (1 and C2, respectively, intersect f(diam(P)) — see Figure [5] 
By the convexity of P and Pi , we have 

area(Pi) > area^cr^) = ( ^^pj j -area^cr^) > ( l^j5j ) -area(P). 

Since arca(Pi) = a ■ arca(P), it follows that si < y/a ■ diam(P). Using that 
a > 1/fc 2 we can now derive 

diam(Pi) 2 . (si + 4-diam(P)/fc 6 ) 2 • diam(P) + 4 • diam(P)/fc 6 ) 2 

1 area(Pi) ~~ area(Pi) a ■ arca(P) 

diam(P) 2 / 4 \ 2 / 8 16 

< asp(P) ■ + i 
Since / is bitonic, it follows that 

min f(z) > min < max /(z), max f(x 

zG[si,diam(_P)-s 2 ] l_zG[0,si] ze[diam(P)-s 2 ,diam(P)] 



Therefore, 



area(p2) area(Pi) 



diam(P) — Sx Si 

Because P2 is contained in a rectangle with one edge of length diam(P) — si 
parallel to V\V2, and one edge of length -p- • diam(P) normal to V1V2, we have 

4 

diam(P2j < diam(P) — Si H — ~ ' diam(P). 

Putting everything together, we get 

= diam(P 2 ) 2 (diam(P).(l + 4A ;6 ) s l} 2 /1 + 4^-a 

M ; area(P 2 ) " (1 - a) • area(P) " PV J \ ^T^l 

< asp(P) ■ ^1 + 4 • < asp(P) ' ( x + ^) < as P( p ) ' (l + ^ 

< asp(P) ■ (l + i 

This concludes the proof. □ 

Now we have all the necessary tools to prove a bound on the aspect ratio of the polygonal 
partition constructed by the greedy method. 
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Theorem 2. Let T be a properly weighted tree with n nodes. Then the greedy partitioning 
method constructs a polygonal partition for T whose aspect ratio is O ( (depth (7~) + lgn) 8 ). 

Proof. Recall from Section [2] that our algorithm starts by transforming T to a binary tree T' 
of depth 0(depth(T) + log n). This is done in such a way that a polygonal partition for 
T' induces a polygonal partition for T of the same (or better) aspect ratio. We then 
recursively apply the greedy cutting strategy to T' ■ Hence, it suffices to show that a recursive 
application of the greedy strategy to a tree of depth d produces a polygonal partition of 
aspect ratio 0(d 8 ). 

Let A{i) be the worst-case aspect ratio of a polygon P{v) produced by the greedy method 
over all nodes v at depth i in the tree. Note that P(root(T)) is a square and each polygon 
at depth i has at most i + 4 vertices. By Lemma [3] we thus have 



if i = 
if i > 



A(l) ~ { max{( i + 3) 8 , + i 

We can now prove by induction that A(i) < (i + 3) 8 . Indeed, for i = this obviously holds, 
and for i > we have 

A {i) < max{( l + 3) 8 , (l + -A(i-1)} 
< max{(i + 3) 8 , (l + ifg) '(* + 2) 8 } 

(i+^)-(*+2) 8 = {(i+^Hm) 8 }-(*+ 3 ) 8 

< ^ + 3) 8 . 

Hence, A(d) < (d + 3) 8 , which finishes the proof. □ 



and 



5 A lower bound for polygonal partitions 

In the previous sections we have seen that any properly weighted tree T with n nodes admits 
a polygonal partition whose aspect ratio is (9(depth(T) + logn). In this section we prove 
this is almost tight in the worst case, by exhibiting a tree for which any polygonal partition 
has aspect ration SI (depth (T)). 

We start with an easy lower bound on the aspect ratio of a convex polygon in terms of 
its smallest angle. 

Observation 1. Let P be a convex polygon and let a be the smallest interior angle of P. 
Then the aspect ratio of P is at least 2/a. 

Proof. Let v be a vertex of P whose interior angle is a. Then P is contained in the circular 
sector with radius diam(P) and angle a whose apex is at v. This sector has area (a/2) • 
diam(P) 2 , from which the observation readily follows. □ 
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A d 



v,i 



P(A, 



Figure 7: (i) Structure of the tree for the lower-bound construction, (ii) Illustration for the 
proof of Lemma |U 



Next we show how to construct, for any given depth d, a weighted tree T of depth d 
such that any polygonal partition for T has a region with a very small angle. The lower 
bound on the aspect ratio then follows immediately from Observation [TJ 

The structure of T is depicted in Fig. [T^i). The tree T has d + 1 nodes vq, ... ,1/4 that 
form a path, and d other (leaf) nodes Ai, . . . , Xd branching off the path. The idea will be to 
choose the weights of the leaves very small, so that the only way to give the region P(Aj) 
the required area, is to cut off a small triangle from P(vi). Then we will argue that one of 
these triangles must have a small angle. 

To make this idea precise we define 

_ f 1 if i = 

Xi : ~ \ Xi-i/(2Vd) if < i < d 

and we set w(A,) := a;?_ 1 /(4d) for 1 < i < d. Note that defining the weights for Ai, . . . , Ad 
implicitly defines the weights for . . . , v& as well. 

Lemma 4. If each region created in a polygonal partition for T has aspect ratio at most d, 
then we have for < i < d, 

(i) P(Xi-i) is a triangle (if \-\ exists, that is, if i ^ 0); 

(ii) P(vi) has i + 4 sides, each of length at least Xi. 

Proof. We will prove the lemma by induction on i. For i = the lemma is obviously true, 
since P{vo) is the unit square. 

Now let i > 0. By the induction hypothesis, each side of P(z^_i) has length at least Xi_\. 
If P(Xi) would fully contain an edge of its diameter would therefore be more than 

Xi—i. But then its aspect ratio would be 

diam(P(AQ) 2 > xt^ aj_i M 
area(P(Ai)) ~ w(\i) xf^/M 

which contradicts the assumptions. Hence, P{\) is a triangle, as claimed. It also follows 
that P(yi) has i + 4 sides. It remains to show that these sides have length at least x%. 

Let st be the segment that cuts P(i>j_i) into P{vi) and P(\i), let p be the corner of 
P(vi-\) that is cut off, and let q and r be the corners of P(i^_i) adjacent to p — see Fig.JTJii). 



1G 



All sides of P(fj) except qs, st, and tr have length at least Xi-\ so they definitely have length 
at least xi . 

It is easy to see that the angles of any polygon P(vj) are at least n/ 2 — indeed, when 
a corner is cut off from some P(i/j), the two new angles appearing in P(vj + i) are larger 
than the angle at the corner that is cut off. Hence, st is the longest edge of the triangle 
pst = P(X{), and we have 



\st\ > v/arca(P(Aj)) = VMK) = \j = 7^y= = x i- 

Next we show that qs has length at least xc, the argument for tr is similar. Note that 
\ps\ < \J d ■ w(Xi), otherwise P(Aj)'s aspect ratio would be larger than d. Hence, we have 

\qs\ = \pq\ - \ps\ 

> Xi-i — \ps\ (induction hypothesis) 



> Xi-l - y/d-w(Xi) 



id 



= Xi- 
Xi— 

~ ~T 

> Xi. 

□ 

Next we show that one of the triangles P(X{) must have large aspect ratio. 

Lemma 5. If each region created in a polygonal partition for T has aspect ratio at most d, 
then there is a region P{Xi) one of whose interior angles is at most 2ir/(d + 4). 

Proof. By the previous lemma, the region P(vd) has d + 4 sides. The sum of the interior 
angles of a (d + 4)-gon is exactly (d + 2) • 2ir, so one of the interior angles of P(vd) must be 
at least 

(d + 2)-2n 4tt 

= Z7T — 



rf+4 d+4 

If the two edges meeting at some corner p of P{vd) make an angle of at least 2-k — 4ir/(d + 4) 
in P{vd), then they must make an angle of at most At: / [d + 4) in a region P(Xi) adjacent 
top. □ 

Theorem 3. For any d, there is a weighted tree T of depth d such that any polygonal 
partition for T has aspect ratio Q(d). 

Proof. Consider the tree T described above. Assume the aspect ratio of the polygonal 
partition for T is less than d — otherwise we arc done Then by Lemma [5] there is a region 
with interior angle 4ir/(d + 4). By Observation [T] this region has aspect ratio at least 
(rf + 4)/27r = n(d). □ 
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6 Partitions with slack 



Recall that a rectangular partition is a polygonal partition in which all polygons are rectan- 
gles. We show that if we allow a small distortion of the areas of the rectangles, then there 
exists a rectangular partition of small aspect ratio. In fact, this result can also be obtained 
for partitions of a hypercube in M. d . We now define more precisely which type of distortion 
we allow. 

Let T be a properly weighted tree. A rectangular partition with e-slack in M. d for T 
assigns a d-dimensional hyperrectangle R{v) to each node v £ T such that 

• the hyperrectangle i?(root(T)) is the unit hypercube in R d ; 

• for any two nodes v, \i such that /x is a child of v we have 

volTO) < vol(i?H) < vol(fl(AQ) 
w(ji) ~ w(y) ~ w(n) 

• for any node v , the hyperrectanglcs assigned to the children of v have pairwise disjoint 
interiors and are contained in R[y). 

Observe that as we go down the tree T, the volumes of the hyperrectangles can start to 
deviate more and more from their weights. However, the relative volumes of the hyperrect- 
angles of the children of a node v stay roughly the same and together they still cover R(y) 
almost entirely. 

For convenience, we will work with the rectangular aspect ratio of a <i-dimcnsional hy- 
perrectangle rather than using the aspect-ratio definition given earlier. The rectangular 
aspect ratio asp rcct (i?) of hyperrectangle R with side lengths s%, S2, . . . , sa is defined as 
asp rect (i?) := |« . It can easily be shown that for 2-dimensional rectangles, the aspect 
ratio and the rectangular aspect ratio are within a constant factor. We define the rectangu- 
lar aspect ratio of a rectangular partition as the maximum rectangular aspect ratio of any 
of the hyperrectangles in the partition. 

Our algorithm to construct a rectangular partition always cuts perpendicular to the 
longest side of a hyperrectangle R(v). Since we can shrink each of the hyperrectangles of 
the children of v by a factor 1 — e, we have some extra space to keep the aspect ratios 
under control. We will prove that this means that the longest-side-first strategy can ensure 
a rectangular aspect ratio of 1/e. The basic tool is the following lemma. 

Lemma 6. Let < e < 1/3, and let R be a hyperrectangle with asp rcct (R) < 1/e. Let 
S = {wi, . . . ,Wk} be a set of weights with y —1 u>.; = (1 — e) • vol(i?). Then there exists a 
set {R\, . . . , Rk} of pairwise disjoint hyperrectangles, each contained in R, such that for all 
1 < i < k we have vo\(Ri) = Wi and asp rect (i?i) < 1/e. 

Proof. We will prove this by induction on k. The case k = 1 is trivial, so now assume k > 1. 
For a subset S' C S, we define w(S') := Ylw-eS' Wi - Assume without loss of generality that 
W\ = maxi Wi . There are two cases. 

• If w\ < (1 — e) ■ w(S), then we can split S into subsets S' and S" such that w(S') > 
e-w(S) and w(S") > e-w(S). Indeed, if i* is the minimum index such that X)-=i w i — 
e-w(S), then J2l=i w i — (1~ £ )' W (S) since e < 1/3. Hence, setting S' = {w\, . . . , w^} 
satisfies the condition. 
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We now cut R perpendicular to its longest side into hyperrectangles R' and R" such 
that 

vol(R')=vol{R)-^y§- and vol(i?") = vol(i?) • 
Note that this implies that 

w(S') = -^.vol(i?') = (l-e)-vol(i?'). 

Since w(S') > e-w(S) and we cut perpendicular to the longest side, we have asp rect (i?') < 
1/e. Hence, by induction there exists a set of pairwisc disjoint hyperrectangles, each 
contained in R' , whose volumes are the weights in S' and whose aspect ratios are at 
most 1/e. Similarly, there is a collection of suitable hyperrectangles contained in R" 
for the weights in S". Hence, we have found a set of hyperrectangles for S satisfying 
all the conditions. 

If wi > (1 — e) -w(S) we split R into R\ and R' by cutting perpendicular to the longest 
side, such that vol(i?i) = w\. Since 

wi > (1 - e) ■ w{S) = (1 - ef ■ vol(i?) > e • vol(iJ), 

we have asp rect (i?i) < 1/e. Furthermore, since w(S) = (1 — e) ■ vol(i?) we certainly 
have tui < (1 — e) ■ vol(ii), and so 

vol(iJ') = vol(iJ) -W X >£- vol(R). 

This implies that asp rect (i?') < 1/e. Finally, we note that 

w(S\{wi}) w(S)-wi w(S) 

< 



vol(R') vol(i?) - wi vol(i?) 

Hence, we can slightly shrink R' such that w(S\ {wi}) = (1 — e) ■ vol(i?'), which means 
we can apply the induction hypothesis to obtain a suitable set of hyperrectangles for 
the weights in S\ {wi} inside R'. Together with the hyperrectangle R\ for w\, this 
gives us a collection of hyperrectangles for the weights in S satisfying all the conditions. 

□ 

We can now prove our final result on partitions with slack. 

Theorem 4. Let < e < 1/3, and let d > 2. Let T be a properly weighted tree. Then there 
exists a rectangular partition with e-slack in ~BL d for T whose aspect ratio is at most 1/e. 

Proof. We construct the rectangular partition recursively, as follows. We start at the root of 
T, where we set i?(root(T)) to the unit hypercube. Note that the rectangular aspect ratio 
of a hypercube is 1. Now, given a hyperrectangle R{v) of an internal node v such that the 
rectangular aspect ratio is at most 1/e, we will construct hyperrectangles for the children 
of v of the required aspect ratio. To this end, for any child fj, of v we define 

*/ \ ti \ i \ vol(R(i>)) 
w*(ji) := (1 - e) • w(ji) ■ \ \ . 
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Observe that since J2fi w (f Ji ) = w { v )i we have 

5>*00 = (l-e)-vol(ii(i/)). 

Hence, we can apply Lemma [S] with S — {w*(fi) : (i is a child of v} to partition into 
pairwise disjoint hypcrrectangles R^, each contained in R(v) and of aspect ratio at most 1/e. 
The volume of each R(^) will be w*(^i), which is in the allowed range. Finally, we recurse 
on each non-leaf child. After we the recursive process has finished, we have the desired 
partition. □ 

7 Embedding ultrametrics into R d 

Before we describe our algorithm for embedding ultrametrics into R d , we define a -hierarchical 
well separated trees, or a-HSTs for short, introduced by Bartal [3]. Let a > 1 be a parame- 
ter. An a-HST is a rooted tree T with all leaves on the same level, where each node v has 
an associated label l(v) such that for any node v with parent fi we have = a ■ l{v). The 
metric space that corresponds to the HST T is defined on the leaves of T, and the distance 
between any two leaves in T is equal to the label of the their nearest common ancestor. 

Let M = (X, D) be the given ultrametric. After scaling M, we can assume that the 
minimum distance is 1 and the diameter is A. For any a > 1, we can embed M into an 
a-HST, with distortion a [3]. Given M, we initially compute an embedding of M into a 
2-HST T with distortion 2. Let Mj- = (X, D-y) be the metric space corresponding to T. 
Any embedding of Mj- into M d with distortion c', yields an embedding of M into H d with 
distortion O(c'). It therefore suffices to embed of Mj- into M. d . With a slight abuse of 
notation we will from now on denote the leaf of T corresponding to a point x £ X simply 
by x. 

A lower bound. To prove the approximation ratio of our embedding algorithm, we need a 
lower bound on the optimal distortion. We will use the lower bound proved by Badoiu et al. 
[7], which we describe next. 

Consider an embedding </> of Mj- into M. d . Assume without loss of generality that 4> is 
non-contracting, that is, </> does not make any distances smaller. For each node v £ T we 
define a set A v C K d as follows. Let Bd{r) denote the ball in K d centered at the origin and 
of radius r. For a leaf v of T, let A v be equal to the ball Bd(^) translated so that its center 
is 4>(v). For a non-leaf node v with children /ii, . . . let A v be the Minkowski sum of 
Ut=i w ith Bd{l{v)). By the non-contraction of <j) it follows that for each pair of nodes 
v, v' that are on the same level of T, the regions A v and A u > have disjoint interiors — see 
Figure [5] for an example. Intuitively, the volume A v is necessary to embed all the points 
corresponding to the leaves below v such that the embedding is non-contracting. This in 
turn can be used via an isoperimetric argument to obtain a lower bound on the maximum 
distance (and, hence, distortion) between the images of these leaves. 

We cannot compute A u exactly, however, since we do not know the embedding. Hence, 
following Badoiu et al. [7] we define for each v £ T a value A*{y) that estimates the volume 
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A V3 



A V2 



A Vx 



Figure 8: The sets A v for a non-contracting embedding of an HST. 



of A v , as follows. 

vol (-Bd(i)J if v is a leaf 

A*(v) = { / / f u ^\VA d 

E M is a child of v \A\^ ld + vol [B d (^)) J otherwise. 

The estimate A*(y) can be used to obtain a lower bound on the distortion, as made 
precise in the next lemma. For any V > 0, let rd(V) be the radius of a d-dimensional ball 
with volume V; thus we have 

where T(z) = J^t^e^dt. 

Lemma 7 ([7], Corollary 1). Let OPT denote the distortion of an optimal embedding of M-j- 
intoR d . Then 

rd{A*{v)) 1 

OPT > max — ■ 1. 

v is internal node of T *\ v ) 

The algorithm. We are now ready to describe the embedding / of Mj- into K d . The 
intuition behind our algorithm is as follows. The lower bound given by Lemma [7] implies 
that an embedding is nearly-optimal if it results in sets A v with small aspect ratio. Our 
approach, however, is essentially reversed. We first compute a hyperrectangular partition 
of M. d into hyperrectangles with small aspect ratio. The hyperrectangle computed for the 
leaves in T will roughly correspond to the balls around the embedded points x <G X, so 
given the partition we will be able to obtain the embedding by placing f(x) at the center 
of the hyperrectangle computed for x. The weights we use for the nodes of T will be the 
values A*{y), which using Lemma [7] can then be used to bound the distortion. 
More precisely, the algorithm works as follows. 

1. Compute for each node v £ T the value A*. 

2. Compute a hyperrectangular partition with slack e := min(i, lo * A ) . for T, where A* 
is used as the weight of a node v. Note that T is not properly weighted: the weight of 
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root(T) will be greater than 1 and the weight of an internal node v is larger than the 
sum of the weights of its children. However, we can still apply Theorem [4] Indeed, by 
scaling the weights appropriately we can ensure that the root has weight 1, and the 
fact that the weight of an internal node v is larger than the sum of the weights of its 
children only makes it easier to obtain a small aspect ratio. Thus we can compute a 
hypcrrcctangular partition for T whose rectangular aspect ratio is at most log A. 

3. Let P{v) denote the hyperrectangle computed for node v £ T in Step [21 We slightly 
modify the hyperrectangles P(v), as follows. Starting from the root of T, we traverse 
all the nodes of T . When we visit an internal node v, we shrink all hyperrectangles 
of the nodes in the subtree rooted at v by a factor of 1 — 1/ log A, with the center of 
the (current) hyperrectangle P'[y) of v being the fixed point in the transformation. 
Note that P'(y) itself is not shrunk. Thus the shrinking step moves the hyperrectangles 
contained inside P'{y) away from its boundary and towards its center, thus preventing 
points in different subtrees to get too close to each other. 

4. Let P'(y) denote the hyperrectangle computed in Step [3] Then we define the embed- 
ding f(x) of a point x € X to be the center of the hyperrectangle P'(x). 

It remains to bound the distortion of /. We will need the following observation, which 
bounds the diameter of a hyperrectangle in terms of its volume and aspect ratio. 

Observation 2. Let P be a hyperrectangle in R d with aspect ratio at most a. Then, 
assuming a 

diam(P) < aV2 ■ vol(P) 1/d . 



From now on we assume that log A > \fd — 1 

Lemma 8. The expansion of f is O f • log A ■ opt 

Proof. Consider points x, y € X . Let v be the nearest common ancestor of x and y in T . We 
have Dj-(x, y) = l(v). Both f{x) and f{y) are contained in P'(y), which in turn is contained 
in P{v). Hence, diam(P(z/)) gives an upper bound on \xy\. Note that asp roct (P(^)) < log A 
and that vol(P(V)) < A*. (We do not necessarily have vol(P(^)) = A* because of the slack 
in the partition.) Hence, 

\xy\ < diam(P(i/)) 

< log A ■ v/2- vol(P(^)) 1 / d 

< logA-V2-«) 1/d 

= o(logAT d (^*)-i) 

= o(logA-( O PT + lH0,)-^) 

= o(logA-OPT-D r (x,y)-^j, 

which shows the expansion is 0(log A • opt • -^)- □ 

Lemma 9. The contraction of f is O ( Vd ■ log 2 A 



22 



Proof. Since T is a 2-HST, we have A = /(root(T)) = 2 do P th ( r ). Hence, depth(T) = log A. 
It follows that for each node v £ T, 



vol(P») = O 



1 - 



1 



log A 



log A 



vol(P(») 



n{voi{P(v))) = n 



1 - 



i 



log A 



log A 



Consider points x, y £ X , and let v be the nearest common ancestor of x and y in T . We 
will consider the following two cases for v: 

• Case 1: v is the parent of x and y in T . 

Since the minimum distance in Mj- is 1, it follows that Dq-{x, y) = 1. By construction, 
f{x) is the center of P'(x). Let t be the distance between f(x) and dP'(x). Since 
asp rect (P'(a;)) < 1/e, we have 



1 



VdhgAJ ' 



Thus, > t 



VVdlogA 



• Case 2: v is not the parent of x and y in T . 

Let \x be the child of v that lies on the path from v to x. Let t be the distance 
between x and d P'(fi). Because of the shrinking performed in Step [31 we know that 
t > (1/logA) • (s/2), where s is the length of the shortest edge of P'(fi). Since 
asp rect (P'(/i)) < we have s > e 1 " 1 ^ ■ vol(P'(/i)) 1 /< i . Hence, 



t > 



log A 2 

_l-l/<i 



^.voKP'^)) 1 ^ 



log A 
1 



loFA 
ioFA'^) 

J(!E.«) 



1 

v^7 



Vdlog 2 A 



Combining Lemmas [S] and |H] we obtain the main result of the section. 



□ 



Theorem 5. For any fixed d > 2, £/iere exists a polynomial-time, O y\/d ■ log 3 A j -approximation 
algorithm for the problem of embedding ultrametrics into M. d with minimum distortion. 
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